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@ Multiprocessor buffer system. 

@ The present invention is directed to a buffer 
swapping scheme to communicate a message 
from a first device to a second device wherein a 
pointer to a free buffer is returned to the first 
device by the second device as a condition for 
the first device to pass a pointer to a buffer 
containing a message intended for the second 
device. 
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Field of the Invention 

The present invention is directed to a multipro- 
cessor computer system and, more particularly, to a 
scheme for communicating messages between the 5 
processors of the multiprocessor computer system. 

Background of the Invention 

Modern computer systems are often implement- io 
ed as multiprocessor systems wherein a plurality of 
processors are coupled to one another by. e.g.. a bus. 
In the operation of the computer system, it is often 
necessary for the processors to communicate mes- 
sages to one another. In one known scheme, each is 
message is stored in a buffer by the processor gen- 
erating the message (the source processor). The 
source processor then passes a pointer to the buffer 
to the processor to which the message is directed 
(the destination processor) and the destination proc- 20 
essor utilizes the pointer to read the message from 
the buffer. As should be understood, each processor 
can be either a source or destination of messages. 

The buffer pointer passing scheme prevents tem- 
porary input/output bandwidth crunches at any par- 25 
ticular destination processor, as might occur, e.g., if 
each entire message were directly transmitted to the 
destination processor and several source processors 
need to forward messages to the particular destina- 
tion processor during the same time period. The pass- 30 
ing of pointers permits each destination processor to 
schedule the movement of messages from the appro- 
priate buffers to the respective destination processor. 
A particular destination processor can readily receive 
and store pointers and access the associated buffers 35 
for the full messages over a time period consistent 
with the input/output bandwidth of the particular des- 
tination processor. 

The buffers used to communicate messages can 
all be arranged in a shared central memory, coupled 40 
to the bus. The use of a central memory simplifies the 
overall design requirements for the buffer system and 
provides flexibility in the allocation of buffers be- 
tween processors. At initialization of the system, each 
source processor is allocated a number of buffers 45 
within the central memory consistent with the amount 
of message traffic expected from that source. In ad- 
dition, the shared memory can be implemented with 
an input/output bandwidth at, e.g.. twice the peak re- 
ceive rate for messages so that the central memory so 
can handle the higheat possible amount of full mes- 
sage traffic generated throughout the system. In this 
manner, there are less rigorous bandwidth require- 
ments at each potential destination processor, which, 
as indicated, need only receive pointers to the buffers ss 
storing the full messages and operate to schedule the 
transfer of the full messages at a rate within the in- 
put/output bandwidth of the particular destination 



processor. 

While the known buffer pointer approach to mul- 
tiprocessor communication provides an advanta- 
geous scheme for effectively passing messages be- 
tween processors, the scheme does not adequately 
provide for an assured supply of free buffers for each 
source processor through a reliable return of free buf- 
fers by destination processors. This is particularly 
true in multiprocessor systems with heavy message 
traffic. 

Summary of the Invention 

The present invention provides a message com- 
munication scheme that assures an orderly and guar- 
anteed return of free buffers to a source processor as 
part of the message forwarding protocol. Generally, 
the present invention comprises a buffer swapping 
scheme wherein a pointer to a free buffer is returned 
to a source processor by a destination processor as 
a condition for the source processor to pass a pointer 
to a buffer containing a message directed to the des- 
tination processor. Thus, the consumer of buffers, the 
destination processor, is made responsible for main- 
taining an adequatesupply of the buffer resource. Ac- 
cording to a feature of the present invention, the buf- 
fer swapping scheme is implemented In a manner to 
avoid the need to synchronize the source and destin- 
ation processors for the exchange of buffer pointers 
or the need to synchronize source processors trans- 
ferring messages to a common destination processor. 

To that end, the destination processor maintains 
a pool of pointers to free buffers and the source proc- 
essor perform an independent sequence of steps, 
during the transfer of a pointer to the destination proc- 
essor, to acquire one of the free buffers before trans- 
ferring the pointer to the buffer containing the mes- 
sage for the destination processor. Each destination 
processor is arranged to maintain a set of buffer ring 
memories, one dedicated to each potential source 
processor, to link each potential destination to all po- 
tential sources of messages. The ring memories for a 
particular destination processor can be implemented 
as a local memory at the destination processor. In the 
alternative, all of the ring memories can be imple- 
mented in a centralized memory resource accessible 
by all of the source and destination processors for the 
exchange of buffer pointers. 

Each buffer ring memory includes a series of lo- 
cations for the storage of pointers to buffers that con- 
tain messages from the respective source that are di- 
rected to the destination processor associated with 
the buffer ring. The use of dedicated ring memories 
removes the need to synchronize source processors 
transferring messages to a common destination proc- 
essor. As described, each source processor has a de- 
fined, dedicated memory space within its buffer ring 
memory for the storage of buffer pointers and, thus, 
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can read and write to that memory space exclusive of 
other potential source processors. The use of a com- 
mon pointer memory space for a particular destina- 
tion processor that is shared by all source processors 
transferring messages to the particular destination 
processor would require atomic or interlocked bus 
transactions such as a read-modif y-write interlocked 
bus transactions, to assure completion of a buffer ex- 
change by one of the source processors. The atomic 
bus transactions would be needed to preclude access 
to the common memory apace by another source 
processor prior to completion of the current buffer 
pointer exchange. 

A suitable indication, such as, e.g., an "owner- 
ship" bit is arranged at each location of a buffer ring 
memory to mark the location as valid/not valid in re- 
spect of each of the source and destination proces- 
sors linked by the particular buffer ring memory. The 
ownership bit is used to communicate to a processor 
reading the location whether the pointer stored at the 
location is "owned" by that processor. In general, a lo- 
cation owned by the source processor corresponding 
to the buffer ring memory contains a pointer to a free 
buffer that is avaaable for transfer back to the source 
processor. On the other hand, a location owned by 
the destination processor corresponding to the buffer 
ring, contains a pointer to a buffer storing a message 
from the respective source processor that is intended 
for the destination processor. 

A message transfer, via the transfer of a pointer 
to the buffer where the message is stored, can be 
performed in two bus transactions between any 
source processor and any destination processor. A 
source processor having a message to send to a par- 
ticular destination processor will initially store the 
message in a free buffer in the central shared mem- 
ory. During a first bus transaction, the source proces- 
sor reads a next location of its buffer ring memory as- 
sociated with the intended destination processor If 
that location is owned by the source processor, the 
source processor reads the pointer stored therein for 
use as a free buffer in a subsequent message trans- 
fer. 

The source processor, during the second bus 
transaction, then writes, into the location of the buffer 
ring memory from which it just read the pointer to the 
free buffer, the pointer to the buffer of the central 
memory where it stored the message for the particu- 
lar destination processor. The source processor will 
also change the state of the ownership bit to indicate 
that the location is now owned by the respective des- 
tination processor to indicate that the location stores 
a pointer to a buffer containing a message for that 
destination. Thus, the source processor acquires a 
free buffer as part and parcel of the transfer of a mes- 
sage via independent bus transactbns to access its 
buffer ring memory. 

Each destination processor is operated to poll the 



locations of the local buffer ring memories for owned 
locations, i.e., locations containing pointers to buffers 
that have messages for the respective destination 
processor. When the destination processor locates 

5 an owned location, it reads the pointer value stored 
therein and then overwrites that location with a poin- 
ter to a free buffer from its nool of free buffers. There- 
after the destination processor changes the owner- 
ship bit to once again indicate that the source proc- 

10 essor owns that location and the pointer to a free buf- 
fer stored therein. Accordingly, each destination proc- 
essor must make a free buffer available for a partic- 
ular source processor upon the reading of each mes- 
sage pointer. The destination processor can queue 

15 pointers to buffers containing messages forfuture ac- 
cess to the central buffer memory, to read the mes- 
sages at the input/output bandwidth of the destina- 
tion processor. 

Accordingly, a free buffer is made available by a 

20 destination processor to a source processor each 
time a pointer to a buffer containing a message is 
transferred to the destination processor. Moreover, 
the pointer exchange is completed by independent 
operation of each of the source and destination proc- 

25 essors, via the two bus transactions of the source 
processor to read and write to a respective dedicated 
ring memory and the separate polling/free pointer 
writing operation of the destination processor. Thus, 
there is no need to synchronize the processors to af- 

30 feet each free pointer/message pointer exchange and 
the exchange can be made in a bus independent op- 
eration without the need for read-modify-write or 
other interlocked bus transactions. 

The independent operation of the source and 

35 destination processors in the buffer swapping 
scheme can also be used to ease latency require- 
ments or further reduce bus bandwidth. The source 
processor can, for example, be operated to "read 
ahead" a number of locations in a buffer ring memory 

40 to acquire a preselected number of free buffers in a 
single bus transaction, A high performance source 
processor can thereby be prepared to send a burst of 
messages using the acquired number of free buffers. 
Similarly, the source processor can operate to write 

45 several message pointers into the number of loca- 
tions in the buffer ring memory in a single bus trans- 
action to further ease latencies and reduce bus band- 
width. In addition, the use of dedicated buffer ring 
memories at each potential destination processor. 

50 one for each potential source, enables any particular 
destination processor to automatically know where 
messages are coming from and to which source proc- 
essors it is returning free buffers. 

In one embodiment of the present invention, con- 

55 trol information, in addition to the ownership bit, is 
provided at each buffer ring memory location. The 
control information comprises a "beginning/continua- 
tion/last" marker field and. in certain pointers, a "read 
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permission" field. 

The "beginning/continuation/last" marker field is 
used to link buffers together so that messages too 
large to be stored in a single f bced size buffer can be 
stored into several logically contiguous buffers. The 
marker field of a buffer storing a first portion of a long 
message is set to indicate a "beginning" state, with 
subsequent buffers for subsequent portions of the 
message having their respective marker fields each 
set to a "continuation" state until the buffer storing the 
last portion of the message, which has its markerf ield 
set to a "last" state. In this manner, messages of vary- 
ing sizes can be transmitted between processors 
while maintaining a fixed size for each buffer. 

The fixed size buffer feature facilitates the buffer 
swapping sheme according to the present invention 
since each free buffer that is swapped for a message 
buffer is advantageously stored in a buffer ring mem- 
ory location prior to the writing of the message buffer, 
as described above. Thus, the use of a fixed size for 
all buffers insures that adequate buffer space is con- 
sistently returned during the buffer swapping opera- 
tion In other words, a free buffer of a size equal to any 
message buffer will always be swapped for a mes- 
sage buffer in the buffer exchange. By utilizing the 
"beginning/continuation/last" marker field, the fixed 
size for the buffers can be optimized for efficient 
memory allocation by fixing the size to be less than 
the maximum possible message size, as, e.g. a size 
that approximates the average message size. Yet the 
"beginning/continuation/last" marker field provides 
flexibility to link buffers together so as to accommo- 
date messages up to the maximum message size. 

The "read permission" field permits a source 
processor to transfer a message to several destina- 
tions in one message transfer operation. The inclu- 
sion of a read permission field in a pointer communi- 
cates to each intended destination processor that the 
buffer ring location contains a pointer to a message, 
but that the buffer ring location has read only access 
and the buffer is still owned by the source. The source 
processor maintains a table storing information on 
destination processors to Which a particular message 
is to be transferred. A"message read" communication 
is returned to the source processor by each intended 
destination processor upon reading the particular 
buffer ring location so that the source processor can 
progressively dear the table as "message read" com- 
munications are received. When all of the set of in- 
tended destination processors have read the mes- 
sage, the buffer is once again free for use with a pos- 
itive indication from the now cleared table that the 
message has been received by all of the intended 
destination processors. 

According to the invention there is provided a 
computer system as defined in daim 1 . The invention 
further provides a method for transferring a message 
from a first device to a second device in a computer 



system as recited in claim 7. Further, the invention 
provides a method of transferring a message from a 
first device to a second device in a computer system 
as defined in daim 8. 
5 Thus, the present invention provides a straight- 
forward scheme for transferring messages between 
processors. The scheme can be implemented in a bus 
independent transfer that assures a guaranteed re- 
turn of free buffers to each source processor. The si- 
te zes of the buffer ring memories at each destination 
processor are fixed so that the polling operation of a 
destination processor can be optimally timed to keep 
pace with the transfer of messages to the destination 
processor. In this manner, as each source processor 
15 moves around its dedicated buffer ring memories, a 
next location is always owned by the source proces- 
sor. As such, each source processor can also be ar- 
ranged to assert a malfunction alarm when the own- 
ership of a next buffer ring location read by the source 
20 processor is not owned by the source. Thus, the buf- 
fer swapping scheme of the present invention can be 
conveniently used to implement an effective alarm 
function so that appropriate action can be taken by a 
network manager to reconfigure the system as nec- 
25 essary to assure the free buffer swap. 

Brief Description of the Drawings 

Fig. 1 is a block diagram of a representative mul- 
30 tiprocessor computer system implementing the buffer 
swapping scheme of the present invention. 

Fig. 2 is a schematic representation of the buffer 
swapping scheme, as implemented in the computer 
system of Fig. 1 . 
35 Fig. 3 is a software block diagram of a source 

processor performing a buffer swap in respect of a 
message to be transmitted to a destination processor. 

Fig. 4 is a software block diagram of a destination 
processor performing a buffer swap in respect of a 
40 message transacted by a source processor. 

Fig. 5 is a block diagram of a pointer storage lo- 
cation. 

Fig. 6 is a block diagram of a mechanism for a 
message broadcast 
45 Fig. 6A is a block diagram for a pointer location 
used in a message broadcast. 

Detailed Description 

50 Referring now to the drawings, and initially to Fig. 
. 1, there is illustrated a multiprocessor computer sys- 
tem generally designated by the reference numeral 
10. In the illustrated system, a plurality of line cards 
12 is arranged so that each line card 12 is coupled to 

55 a backplane bus 14. A central buffer memory 16 is 
also coupled to the bus 14 so that the line cards 12 
can access the central buffer memory 16 to read and 
write messages that are to be communicated be- 
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tween the line cards 12. 

There are n line cards 12, designated 0 to n, 
which together provide a multi-protocol router/bridge 
in a computer network (not illustrated). Each lire card 
12 comprises a line interface 18, a processor 20, a 
bus interface 22 and a ring memory 24. 

Each line interface 18 is adapted to be coupled to 
a data transmission service, such as, e.g., Ti or 64 K 
bps services provided by long distance telecommuni- 
cation companies, or to a LAN to receive and transmit 
data packets from and to other components of the 
computer network. The line interfaces 18 are each 
configured to provide physical and data link level 
functbnality according to a network protocol imple- 
mented on the data transmission service to which the 
particular line interface 18 Is to be coupled. 

The processor 20 of each line card 12 performs 
routing or bridging processing in respect of data pack- 
ets received at or to be transmitted from the respec- 
tive line card 12. On the receive side, each processor 
20, e.g., examines the network address contained in 
the header of a received data packet to determine 
which line card 12 should be utilized to transmit the 
data packet for continued routing or bridging within 
the computer network. 

Thus, each data packet is a potential "message" 
to a processor 20 of another line card 12, ie., the re- 
ceived data packet must be transferred to the proc- 
essor 20 of the transmitting line card 12 for process- 
ing and eventual transmission via the line interface 1 8 
of the other line card 12. Accordingly, the multipro- 
cessor computer system 10 illustrated in Fig. 1 pro- 
vides an example of a high message traffic system. 
Each processor 20 will be primarily processing data 
packets that are received from other processors 20 or 
are to be communicated to other processors 20, to 
complete the transmission of data packets through 
the router or bridge provided by the multiprocessor 
computer system 1 0. 

To that end, each processor 20 will store each re- 
ceived message in a free buffer in the buffer memory 
16 via the respective bus interface 22 of the line card 
12. The bus interface 22 operates according to a bus 
protocol implemented on the bus 14 to acquire control 
of the bus 14 and thereafter perform a write operation 
into the selected buffer of the buffer memory 16 via 
the bus 14. According to the present invention, a poin- 
ter to the buffer is then transmitted to the processor 
20 of the transmitting line card 12 which utilizes the 
pointer to read the full data packet from the buffer 
memory 16 for network processing and eventual 
transmission. 

The ring memory 24 of each line card 1 2 is divid- 
ed into n buffer rings, one dedicated to each of the 
other line cards 12 of the system 10 and one dedicat- 
ed to the processor 20 of the same line card 12. (In 
the exemplary system, a data packet received by a 
line card 1 2 that is ultimately to be transmitted by the 



same line card 12, is sent through the same message 
forwarding scheme as other data packets from the 
line card back to the line card.) The ring memories 24 
are each used to store pointers from the other line 

5 cards 1 2 until the processor 20 of the line card 1 2 can 
perform a read bus transaction over the bus 14 to 
read the full data packet from the central memory 1 6. 

Referring now to Fig. 2, there is illustrated a sche- 
matic representation of the buffer swapping scheme 

10 according to the present invention. Each of RX1 and 
RX2 represents one of the line cards 12 when the re- 
spective line card is acting as a receiver of data pack- 
ets and Txa and Txb each represent one of the line 
cards 12 when the respective line card is acting as a 

15 transmitter of data packets. RXP represents a data 
packet receive module provided at each processor 20 
of a line card 12 to process each received data packet 
while TXP represents a data packet transmit module 
provided at each processor 20 of a line card 12 to 

20 process each data packet for transmission. 

As described above, each buffer ring memory 24 
is divided into n rings 24A, with each ring having a pre- 
selected number of memory locations 26 for the stor- 
age of pointers to buffers in the central memory 16. 

25 Moreover, each ring 24A is dedicated for use by one 
of the RXP modules, as illustrated by the lines 28. A 
FREE buffer queue 30 and a transmit (TXQ) buffer 
queue 32 is maintained by the TXP module of each 
tine card 12. 

30 When an RXP module needs to communicate a 

message (data packet) previously stored in a buffer 
of the central memory 16, to a particular TXP module, 
it will first read a location 26 of the dedicated ring 24A 
of the respective line card 12. Each RXP can, e.g., 

35 maintain a ring pointer for each dedicated buffer ring 
24A. Each ring pointer contains an address to a loca- 
tion 26 of the corresponding buffer ring 24Athat is to 
be read during a particular transaction. The RXP mod- 
ule will increment the address value after each trans- 

40 action to continuously traverse the ring buffer 24A. 

Referring for a moment to Fig. 5, each location 26 
includes a pointer value field 34 to store a pointer to 
a buffer of the central memory 16 and an ownership 
field 36 that comprises one bit representing either a 

45 set state or a clear state. The set state can represent 
ownership of the location 26 by the RXP module (the 
source processor) while the clear state can represent 
ownership of the location 26 by the TXP module (the 
destination processor). When the location 26 read by 

50 the RXP module is owned by that module, the pointer 
value 34 (See Fig. 5) is a pointer to a free buffer of the 
central memory 16. The RXP module will read the 
pointer value and store that value for use in a subse- 
quent message transfer, as will appear. 

55 The RXP module will then write the pointer value 

34 for the buffer where the message (data packet) is 
stored in the central memory 16 and change the state 
of the ownership field 36 to indicate ownership by the 
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TXP module of the line card 12. 

Each TXP module Is arranged to poll the loca- 
tions 26 of the rings 24A located at the respective tine 
card 12. When a location 26 is owned by the TXP 
module, the pointer value 36 stored at that location 26 
is read and loaded into the TXQ queue 32. At that 
time, the TXP module will also write a pointer value 
from the FREE queue 30 into the location 26 and 
change the ownership bit to once again indicate that 
the location 26 is owned by the RXP module. 

The TXP module will service each pointer loaded 
into the TXQ queue, as, e.g., on a FIFO basis, to ac- 
cess the central memory for each complete message. 
After the TXP module has read the complete mes- 
sage associated with a particular pointer value, it will 
transfer that pointer value to the FREE queue 30 for 
use in a future buffer swap operation. 

In this manner, buffers are automatically and con- 
tinuously swapped between line cards 12 to facilitate 
message communication with a guaranteed return 
supply of free buffers. Figs. 3 and 4 illustrate the 
above described buffer swapping scheme, in greater 
detail, in respect of source and destination message 
processing, respectively. As illustrated in Fig. 3, each 
complete message (data packet) is loaded into a DE- 
VICE queue 1 00. The RXP module reads the messag- 
es, one at a time, from the DEVICE queue 1 00 and ex- 
amines the header of each data packet to extract the 
network address, as is known in the art , for determi- 
nation of which TXP module is to receive the message 
(data packet) for transmission. 

To that end an ARE request comprising the net- 
work address, is transmitted to an address recogni- 
tion engine (not illustrated) for relevant network infor- 
mation including the identity of the TXP module to be 
used to transmit the data packet. The RXP module 
also writes the data packet to a buffer in the central 
memory 16. An RXP free buffer queue 102 stores a 
plurality of pointers to free buffers acquired from pre- 
vious buffer swaps. The RXP module will read a top 
free buffer pointer from the queue 102 and write the 
current data packet to the buffer pointed to by the 
pointer. The RXP free buffer queue 102 enables the 
RXP module to pipeline data packet processing, i.e. 
the RXP module can begin to process a subsequent 
data packet from the DEVICE queue 100 before the 
return of a free buffer in respect of an immediately 
preceding data packet(s) by storing several firee buf- 
fer pointers in the RXP free queue 102. 

As should be understood, the data packet is writ- 
ten to the central memory buffer pointer to by the 
pointer obtained from the RXP free buffer queue 102 
via a bus transaction over the bus 14 utilizing the bus 
interface 22 of the respective line card 12. The RXP 
module then writes the pointer value into a Pre-AD- 
DRESS queue 104. - 

RXBA is a sub-module of the RXP module ar- 
ranged to receive the ARE result and associate the 



result, including the identify of the TXP module to be 
used to transmit the data packet, with the pointer val- 
ue for the data packet central memory location. The 
associated pointer/TXP information is loaded by the 

5 RXBA module into an Rx BIN 106 for eventual input 
to an RxBD module. 

The RxBD module utilizes the input information 
to generate a descriptor containing network informa- 
tion relevant to the data packet. The descriptor is also 

10 written by the RxBD module to the buffer of the cen- 
tral memory where the data packet itself is stored, for 
use by the TXP module in transmit processing. The 
descriptor can contain any network processing infor- 
mation performed by the RxP module that is also 

15 needed in the transmit processing performed by the 
TXP module. The RxBD module passes each associ- 
ated pair of pointer/TXP information to a SOURCE 
queue 108. 

A SWAP module removes associated pairs of 

20 pointern"XP information from the SOURCE queue 
1 08 and performs the two bus transactions described 
above, over the bus 14, in respect of an owned loca- 
tion 26 of a corresponding ring 24Aof t he TXP module 
associated with a particular pointer in the poin- 

25 ter/TXP information. The free buffer obtained via the 
first bus transaction is loaded into the RXP free buffer 
queue 102 for use by the RXP module in subsequent 
message communications and the pointer obtained 
from the SOURCE queue 108 is written to the current 

30 location 26 of the corresponding ring 24A. The SWAP 
module also changes the ownership bit to indicate 
TXP ownership of the current location 26, 

Referring now to Fig, 4. a ring polling module, 
SCAV RINGS, at each processor 20 of a line card 1 2, 

35 polls each location of the n rings 24A associated with 
the line card 12 to locate owned locations 26, The 
SCAV RINGS module is responsible to read the poin- 
ter of each owned locatton 26 and thereafter write a 
pointer from the FREE queue 30 into that location 26. 

40 The SCAV RINGS module will also change the own- 
ership bit of the location 26 back to the RXP owner- 
ship state. 

As illustrated in Fig. 4, the TXQ queue 32 com- 
prises a set of destination bins 200 and a set of des- 

45 tination queues 202. Each destination bin 200 is dedi- 
cated to one of the rings 24A and each destination 
queue 202. in our representative embodiment of a 
multi-protocol router/bridge, is dedicated to a partic- 
ular network protocol of bridging protocol, e.g., the 

50 OSI and IP network protocols and BR representing a 
bridging protocol. The bin/queue set arrangement is 
used to segregate messages, first by source and then 
by protocol type. This feature of the present invention 
provides flexibility In message handling, in respect of 

55 the source of the message and the type of message, 
as will appear. 

Referring once again to Fig. 5, each location 26 
includes a "beginning/continuation/last" marker field 
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38. As described above, all of the buffers of the cen- 
tral mennory are of equal size to make certain that an 
equal amount of buffer space is swapped in each free 
buffer/message buffer exchange. The "begin- 
ning/continuation/last" marker field 38 provides a 
mechanism to link buffers together for communica- 
tion of messages too large to be stored in a single buf- 
fer. 

When an RXP module is communicating a mes- 
sage (data packet and associated descriptor) that is 
too large for a single buffer of the central memory 16, 
it will communicate the message in several buffers 
and link the buffers via the-"beginning/continuation- 
/last" marker field. The first portion of the message is 
placed in a first buffer and the RXP module sets the 
"beglnning/continuation/last" marker field of the poin- 
ter to the first buffer to indicate a "beginning" buffer. 
Buffers containing subsequent portions of the mes- 
sage are each stored in the central memory 16 with 
the respective marker fields of the respective poin- 
ters each set to indicate "continuation" until the last 
portion of the message. As should be understood, the 
RXP module will set the marker field for the buffer 
pointer containing the last portion of the message to 
indicate "last". The beginning, continuation and last 
buffer pointers will be stored in contiguous locations 
of the appropriate buffer ring 24A associated with the 
destination processor. 

However, to implement a fair polling operation in 
the SCAV RINGS modules of the processors 20, each 
SCAV RINGS module can be arranged, e.g.. to poll a 
location of a particular ring 24Aand move on to each 
other ring 24A before returning to a next location of 
the particular ring 24A. Thus, the SCAV RINGS mod- 
ule does not remove the pointers linked by their re- 
spective "beginning/continuation/last" marker fields 
in order The several destination bins 200 are used to 
acccomodate a fair polling operation and still maintain 
a contiguous relationship between linked buffer poin- 
ters transmitted by a particular RXP module. Each 
SCAV RINGS module loads each pointer read from a 
particular ring 24A into the bin corresponding to that 
ring 24Aso that all beginning, continuation, last buffer 
pointers remain contiguous as they move through the 
transmitting line card 12. 

A SCAV BINS module is used to transfer buffer 
pointers from the destination bins 200 to appropriate 
destination queues 202. As described above, each 
destination queue 202 is dedicated to a particular pro- 
tocol. As illustrated in Fig. 5, each location 26 is pro- 
vided with a TYPE field 40 which is used to indicate 
the protocol to be used for transmission of the data 
packet in the message pointed toby the pointer value. 
The RXP module inserts the protocol type in the 
TYPE field prior to the buffer swap operation. The 
SCAV BINS module examines the "beginning/contin- 
uation/last" markerf ield and TYPE field for each poin- 
ter removed from a particular bin 200 and loads all 



linked buffer pointers for a particular message into 
the destination queue corresponding to the protocol 
indicated In the TYPE field. 

The various destination queues are also used to 

5 introduce fairness into the system by providing a pre- 
selected limited size queue for each protocol type. A 
TxBS module renrroves pointers from the destination 
queues 202 in a straight round robin polling operation 
so that there Is progress in transmitting messages of 

10 each protocol type and no one protocol can prevent 
any other protocol from making progress in transmit- 
ting messages. The multiple destination queue ,202 
arrangement is useful because protocols have differ- 
ent characteristics such as different congestion algo- 

15 rithms. Certain congestion algorithms can block 
other protocols by dominating router operation if the 
round robin sequence through the various protocol 
types were not implemented. 

The TxBS module loads pointers removed from 

20 the destination queues 202 to a TRANSMIT PACKET 
queue 204. The TXP module of the line card 12 re- 
moves pointers from the TRANSMIT PACKET queue 
204, one at a time, and utilizes each pointer to access 
the central memory 16, via the bus 14. to read the 

25 header and body of a data packet and the associated 
descriptor (the message) for transmit processing. The 
data packet obtained from the central memory 16 is 
placed in a DEVICE queue 206 for eventual transmis- 
sion by the line interface 18 of the line card 12. In ad- 

30 dition. the TXP module loads the pointer just used to 
access a message into the FREE buffer queue 30 for 
use by the SCAV RINGS module in a future buffer 
swap, as described above. 

Referring now to Figs. 6 and 6A, there is illustrat- 

35 ed a mechanism and buffer pointer format for broad- 
casting a message to several destination processors. 
As illustrated in Fig. 6A, a broadcast pointer 126, for 
use in a broadcast, is provided with an ownership 
f ield 134, pointer value field 136, "beginning/continu- 

40 ation/last" marker field 138 and TYPE field 140, as in 
the buffer pointer 26 of Fig. 5. In addition, each broad- 
cast buffer pointer 126 is formatted to include a read 
permission field designated as a hook 142. An RXP 
module generates a hook value to identify a particular 

45 broadcast buffer pointer 126 for tracking the broad- 
cast buffer pointer 126. as will appear. 

As illustrated in Fig. 6, the RXP module broad- 
casting a message writes a broadcast pointer 126, 
containing the buffer pointer value for the buffer 

50 where the broadcast message is stored in the central 
memory 16 and hook value, to each destination proc- 
essor that is to receive the broadcast. A special 
broadcast queue 300 is provided at each destination 
processor for each potential broadcast source. Broad- 

55 cast buffer pointers 126 are written into the corre- 
sponding broadcast queues of each destination line 
card meant to receive the message. 

The SCAV RINGS modules of the destination line 
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cards will also poll the broadcast queues for messag- 
es. The ownership bit of each location is used as in 
the regular message forwarding scheme to indicate 
whether a pointer is to a message to be read by the 
SCAV RINGS module. However, the hookf ield 142 is 
interpreted by the destination SCAV RINGS modules 
to mean that it has read only access to the pointer lo- 
cation of the queue. After the SCAV RINGS module 
reads a broadcast queue location, it will set the own- 
ership bit to indicate that the source once again owns 
the location of the broadcast queue for future use. 

Each source line card maintains a table 302 of 
hook values including an entry 304 for each specific 
hook value used in a broadcast pointer 126. Each en- 
try 304 of the table 302 contains a set of bits, one for 
each potential destination processor. The RXP mod- 
ule sets only those bits at a particular entry 304 cor- 
responding to the destination processors to which it 
sent the broadcast pointer 126 having the hook value 
of the entry 304. 

The SCAV RINGS module of each destination 
processor receiving a broadcast message sends the 
hook number to the source corresponding to the 
broadcast queue from where it read the broadcast 
message. This can be accomplished via a pointer ex- 
change according to the present invention in the 
broadcast queues. For example, a SCAV RINGS 
module will write a pointer with a hook value from a 
previously read broadcast message sent by a partic- 
ular source into a location of the broadcast queue cor- 
responding to the source, after it reads the location 
for a current broadcast message. The SCAV RINGS 
module change the ownership bit to indicate that the 
particular source again owns the location. 

Upon reading each hook value during a buffer ex- 
change in a broadcast queue, the RXP module clears, 
in the entry 304 of the table 302 indicated by the hook 
value, the bit corresponding to the destination proc- 
essor sending the hook value. When ail of the bits for 
a hook value have been cleared, the source proces- 
sor knows all of the intended destination processors 
have received the broadcast message and the broad- 
cast pointer is once again free for use in another mes- 
sage transfer. 

The buffer swapping scheme of the present in- 
vention also provides for null pointers that can be 
used under certain circumstances. For example, a 
null pointer can be returned by a destination proces- 
sor when it does not have pointers to free buffers for 
a temporary period, yet it is desirable to continue to 
make progress around a buffer ring memory until free 
buffers become available. This would be an exception 
to the true buffer svrap mechanism of the present in- 
vention, but provides a mechanism to handle tempor- 
ary shortages of free buffer pointers at a particular 
destination processor. The null pointers car also be 
used at initialization of the system to populate the free 
buffer pools via an exchange of free buffer pointers 



for null pointers between a management processor 
and the operating processors of the system via the 
buffer ring memories or to re-balance the number of 
buffers between the source and destination proces- 
5 sors. 

As described generally above, a SWAP module 
can be arranged to assert an alarm whenever a next 
location of a 

10 Effect of the Invention 

Described hereinabove is an improved message 
transfer system for transferring a message from a 
source processor to a destination processor. Buffers 

15 , have been used in between a source processor and 
a destination professor to prevent input/output band 
width crunches at the destination processor. In prior 
art where message transfers were done using a buf- 
fer memory for message transfer, it was necessary to 

20 synchronize the source and destination processors. 
- This is a disadvantage, and the present Invention ob- 
viates the disadvantage by using a buffer swapping 
system, whereby each time a message transfer from 
the source to a destination processor occurs, a free 

25 buffer is made available to the source processor. 

The System 10 has several line cards wherein 
each line card 12 comprises a line internee 18, a 
processor 20, a bus interface 22 and a ring memory 
24. Read/write messages are required to be commu- 

30 nicated between line cards 12, i.e., the processors 20 
which are part of the time cards. 

The present invention teaches a buffer (16) 
swapping scheme wherein a pointer to a free buffer 
is returned to the source processor by a destination 

35 processor each time a message transfer has to occur. 
Thus, a supply of freely available buffer is ensured. 

Each location 26 owned by a source processor 20 
includes a pointer value field 34 to store a pointer to 
a buffer in the central memory 16. A TXP module 

40 (destination processor) is used to transmit the data 
packet to a buffer in the central memory. The above 
system operates efficiently to transfer read/write 
messages from a source processor to a destination 
processor without need to synchronize the two, and 

45 without bandwidth crunches at the destination proc- 
essor. 

Claims 

50 

1 . A computer system (1 0), which comprises: 
a first device (20); 
a second device (20); 
a common memory(16) comprising a plur- 
55 ality of buffers for transferring messages from 

the first device to the second device; 

each of the first device (20) and the sec- 
ond device (20) being coupled to the common 
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memory (16); 

a pointer memory (34) coupled to each of 
the first device and the second device, the poin- 
ter memory having a set of locations (26) for stor- 
ing pointers to buffers of the common memory, 5 
preselected ones of the locations each storing a 
pointer to a buffer of the common memory that is 
available for use to store a message; and 

a pool of pointers (126) to buffers of the 
common memory that are available for use to io 
store messages; 

the first device storing a pointer to at least 
one of the plurality of buffers and operating to 
transfer a message to the second device by writ- 
ing the message to the one of the plurality of buf- 15 
f ers, reading a pointer to a buffer available for use 
from a preselected location of the pointer mem- 
ory and writing the pointer to the one of the plur- 
ality of buffers to the preselected location of the 
pointer memory; 20 

the second device operating to read the 
pointer to the one of the plurality of buffers from 
the preselected location of the pointer memory 
and to thereafter remove a pointer to a buffer 
available for use from the pool of pointers and to 25 
write the removed pointer to the preselected lo- 
cation of the pointer memory. 

2. The computer system of claim 1 , wherein each lo- 
cation of the pointer memory includes an owner- 30 
ship field having a first state and a second state, 

the first state indicating that the corresponding 
location stores a pointer to a buffer that is valid 
for use by the first device to store a message to 
the second device and the second state indicat- 35 
ing that the corresponding location stores a poin- 
ter to a buffer valid for use by the second device 
to read a message at the buffer of the common 
memory pointed to by the pointer. 

40 

3. The computer system of claim 2. wherein the first 
device operates to read and write pointers from 
and to a location only when the ownership field of 
the location is in the first state, wherein the first 
device changes the state of a location from the 45 
first state to the second state after reading a poin- 
ter to a buffer available for use from the location 

and writing the pointer to the one of the plurality 
of buffers to the location of the pointer memory. 

50 

4. The computer system of claim 2, wherein the sec- 
ond device operates to read and write pointers 
from and to a location only when the ownership 
field of the location is in the second state, where- 
in the second device changes the state of a loca- 55 
tion from the second state to the first state after 
reading a pointer to the one of the plurality of buf- 
fers from the location, removing a pointer to a buf- 



fer available for use from the pool of pointers and 
writing the removed pointer to the location. 

5. The computer system of claim 1 , wherein each of 
the buffers of the common memory are equal in 
memory size, wherein each location of the poin- 
ter memory includes a "beginning/continuation- 
/last" marker field to link equal size buffers point- 
ed to by pointers stored in respective locations so 
that a message of a size greater than the size of 
each buffer can be stored in buffers linked by 
"beginning/continuation/last" marker fields, 
wherein the 

"beginning/continuation/last" marker field of a 
first location storing a pointer to a buffer storing 
a first portion of a message is set to a "beginning" 
state and the 

"beginningycontinuation/last" marker field of 
each of n locations storing pointers to buffers 
storing n subsequent portions of the message Is 
set to a "continuation" state and the "begin- 
ning/continuation/Iasr marker field of a last loca- 
tion storing a pointer to a buffer storing an end 
portion of the message Is set to a "last" state. 

6. The computer system of daim 1 , wherein the sec- 
ond device operates to utilize the pointer to the 
one of the plurality of buffers to read the mes- 
sage stored in the one of the plurality of buffers 
by the first device and after reading the mes- 
sage, to place the pointer in the pool of pointers. 

7. A method for transferring a message from a first 
device to a second device in a computer system, 
comprising the steps of: 

providing a common memory comprising a 
plurality of buffers for transferring messages 
from the first device to the second device; 

providing a pool of pointers to buffers of 
the common memory available for use to store 
messages; 

operating the first device to store a mes- 
sage in a preselected buffer of the common mem- 
ory; 

operating the first device to pass a pointer 
to the preselected buffer to the second device; 
and 

operating the second device to utilize the 
pointer to the preselected buffer to access the 
preselected buffer for the message stored there- 
in; and 

operating the second device to place the 
pointer in the pool of pointers upon accessing the 
preselected buffer. 

8. A method for transferring a message from a first 
device to a second device in a computer system, 
comprising the steps of: 
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providing a common memory comprising a 
plurality of buffers for storing messages from the 
first device to the second devices; 

providing a pool of pointers to buffers of 
the common memory available for use to store 5 
messages; 

operating the first device to store a mes- 
sage in a preselected buffer of the common mem- 
ory; 

operating the first device to pass a pointer io 
to the preselected buffer to the second device; 
and 

operating the second device to utilize the 
pointer to the preselected buffer to access the 
preselected buffer for the message stored there- is 
in and to affect a transfer of a pointer from the 
pool of pointers to the first device a condition for 
utilizing the pointer to the preselected buffer. 

9. The method of claim 8 comprising the further 20 
steps of: 

providing a pointer memory having loca- 
tions for storing pointers to buffers of the common 
memory, certain ones of the locations each stor- 
ing a pointer to a buffer available for use to store 25 
a message; and 

performing the step of operating the first 
device to pass a pointer by operating the first de- 
vice to write the pointer into the one of the certain 
ones of the locations after reading that location 30 
for a pointer to a buffer available for use to store 
a message. 

10, The method of claim 9 comprising the further 
step of: 35 

operating the second device to read the 
one of the certain ones of the locations for the 
pointer to the preselected buffer and to write a 
pointerfrom the pool of pointers into that location 
upon and as a condition for reading the F>ointer to 40 
the preselected buffer 
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